A Framework for Speech Source Localization Using Sensor Arrays
نویسنده
چکیده
of \A Framework for Speech Source Localization Using Sensor Arrays," by Michael Shapiro Brandstein, Ph.D., Brown University, May 1995 Electronically steerable arrays of microphones have a variety of uses in speech data acquisition systems. Applications include teleconferencing, speech recognition and speaker identi cation, sound capture in adverse environments, and biomedical devices for the hearing impaired. An array of microphones has a number of advantages over a single-microphone system. It may be electronically aimed to provide a high-quality signal from a desired source location while simultaneously attenuating interfering talkers and ambient noise, does not necessitate local placement of transducers or encumber the talker with a hand-held or head-mounted microphone, and does not require physical movement to alter its direction of reception. Additionally, it has capabilities that a single microphone does not; namely automatic detection, localization, and tracking of active talkers in its receptive area. A fundamental requirement of sensor array systems is the ability to locate and track a speech source. An accurate x on the primary talker, as well as knowledge of any interfering talkers or coherent noise sources, is necessary to e ectively steer the array. Source location data may also be used for purposes other than beamforming; e.g. aiming a camera in a video-conferencing system. In addition to high accuracy, the location estimator must be capable of a high update rate as well as being computationally non-demanding in order to be useful for real-time tracking and beamforming applications. This thesis addresses the speci c application of source localization algorithms for estimating the position of speech sources in a real room environment given limited computational resources. The theoretical foundations of a speech source localization system are presented. This includes the development of a source-sensor geometry for talkers and sensors in the neareld environment, the evaluation of several error criteria available to the problem, and the detailing of source detection and estimate-error prediction methods. Several practical algorithms necessary for real-time implementation are then developed, speci cally the derivation and evaluation of an appropriate time-delay estimator and a novel closedform locator. Finally, results obtained from several real systems are presented to illustrate the e ectiveness of the proposed source localization techniques as well as to con rm the practicality of the theoretical models. A Framework for Speech Source Localization Using Sensor Arrays by Michael Shapiro Brandstein Sc.B., Brown University, 1988 S.M.E.E, Massachusetts Institute of Technology, 1990 Thesis Submitted in partial ful llment of the requirements for the Degree of Doctor of Philosophy in the Division of Engineering at Brown University
منابع مشابه
Three Dimensional Localization of an Unknown Target Using Two Heterogeneous Sensors
Heterogeneous wireless sensor networks consist of some different types of sensor nodes deployed in a particular area. Different sensor types can measure different quantity of a source and using the combination of different measurement techniques, the minimum number of necessary sensors is reduced in localization problems. In this paper, we focus on the single source localization in a heterogene...
متن کاملDirection of Arrival Estimation and Localization Using Acoustic Sensor Arrays
Sound source localization has numerous applications such as detection and localization of mechanical or structural failures in vehicles and buildings or bridges, security systems, collision avoidance, and robotic vision. The paper presents the design of an anechoic chamber, sensor arrays and an analysis of how the data acquired from the sensors could be used for sound source localization and ob...
متن کاملCalibration errors of uniform linear sensor arrays for DOA estimation: an analysis with SRP-PHAT
This article presents an analysis of the sensitivity of geometrical sensor errors in acoustic source localization using the well-established SRP-PHAT method. The array in this analysis is a uniform linear array and the intended source is human speech in the far field. Two major results are presented: inner-sensor geometrical errors in the linear array produce smaller localization errors than co...
متن کاملA Practical Methodology for Speech Source Localization WithMicrophone
Electronically steerable arrays of microphones have a variety of uses in speech data acquisition systems. Applications include teleconferencing, speech recognition and speaker identi cation, sound capture in adverse environments, and biomedical devices for the hearing impaired. An array of microphones has a number of advantages over a single-microphone system. It may be electronically aimed to ...
متن کاملEnabling Speech Applications using Ad Hoc Microphone Arrays
Microphone arrays are central players in hands-free speech interface applications. The main duty of a microphone array is capturing distant-talking speech with high quality. A microphone array can acquire the desired speech signals selectively by leading the beampattern towards the desired speaker. The foreseen application of ubiquitous sensing motivated by the abundance of microphone-embedded ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1995